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Figure 2. The addition of thiamin can reverse progress of rat glioma C6 cells toward 
apoptosis induced by the Naegleria agent. Thiamin added at 12 hour intervals reversed 
progress until death of the cells had already occurred on day 4.5. 
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Figure 3. An SDS-polyacrylamide gel stained for thiaminase and for protein. The Naegleria agent 
was purified through preparative isoelectric focusing (see Example 3). 
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Figure 4. The complete DNA sequence of the coding region of Naegleria gene TTK 
(including the TAA stop codon). The segment of the gene that encodes thiaminase I is 
underlined. 



ATGTCCACTCAACCAAAGACACTCACT GTTGGTCTCTTCCCATATCTTCCTTCTTGGAATGAAAATGGCAACGAAGTTAA 
ATTGATCAATTTGATCAAGGATGTTT T GCCAACTC 

GTGATGCTAGTCTTCAAAGTCTTCCA GATGTTTTCTCAACTGATAGCATTTTCCTTCCATATCTTC 

GTCAAGAGTTTGGATGAATCATTGGTTCGTGGTGTTACTGGTGATTTGCATAGTTTTGTTTCCTCAAGTGCCTCTGTC^ 

TGGTTCCGTTTATGGTTTCCCACAATAC TTGTGCTCAAACTTTTTATTGTCCTCACCAAATGGTACTCAACAAGC 

CCCTTTTAGAATTGGCTCAAAAGGTT GGTTATGAACAAATTGTTTATCCAGATGTTGCCTCTTCTAGTTC^ 

TTCGGATTGTATCAACAATTACTCCAATC ATCATCATCAGCTGCAGTTGATATCAAGGCCTCTGATCTTCCACAATCTGG 

TGACCAAGTCAACAAGGATATCACTCAA AAATATAGAACCATTTTGGATTCAACAGTTGTTGCCTCTCAAAGAGAATATA 

TTAACTCTGTAAAGCAAGGTAAACCAATTT CAAACTACTATGTCGGATATAGTGAAAGTATGTGTGAAATTAAGGATATC 

ATCAGAGATCAACAATACAATGTTCA ACTCATTGGTACCTCTGATAAGCCATACGTTTATACTGATGTTTTGGCTTTGAA 

TOCCAATTTGTGTGATGAAAAGCAA A AGGTTGCTGTTGAAGTTATCAAGAATTTATTGACTAATACTTTAGTOT 

TGTTGGGTCTCGGATTAACTCTCCCAGCCA ACAAGAATGGTATTGCTCATTTGGCTAAATCATCAAACTTTTATGCTCAA 

TTGAGCCAACAATTCGATGCCAAGGAAAG TGAAGTTAGAGTTTTGAGATGTGTTGACTTTGCTAACAAGGAAGTTAAGAA 
TTGT(XTOGT<reTTGA^CATTCrT^ 

AGGCTAAGAGTGGTCACCCTGGTATGCCAATTGGTATGTCACCAATTGCCTATGTTTTGTGGAAGTTCT^ 

TCTAAGGATGATGTCAATTGGTTGAACAGAGATAGATTTGTTTTGAGTAATGGTCACGGTTGTACATTGCTTTATGCCAT 

GTTGCACCTCACTGATTGTAACTTGAGTTTGGATGATCTCAAGAATTTCAGAAGTTTGCATTCCAAGACTCCTGGTCACC 

CAGAATATGGTCACACTGAAGGTGTTGATGCTACTACTGGTCCATTGGGTCAAGGTGTTTGTAATGCTATTGGTATGGCT 

CTCTCTGAAGCTCACTTGGCTGCTCGTTTCAATAAGGATGGACAAAATATCTTTGATCACCACACCTATGTTTTCCTTGG 

TGATGGTTGTTTGATGGAACGTGTTGCTATGGAAGGTCTCTCATTTGCTGGTCACCAAAAGTTGAACAAGTTGATTGTTT 

TCTATGATGACAATAGTATTACTATTGATGGTAAGACTGAATTGACCTTTACTCAAAATACTCCAGAAGTCATGAGAGGT 

TTTGGATGGCACGTAATTGTTGTCGACAAGGCTGATAATGACTTGGTTGGTATTAAGGAAGCTATTTTGGAAGCTCACAC 

TGTTACTGACAAGCCAATCATGATCGTTTGTAAGACTACAATTGGTTATTCCTCAAAGGTTCAAGGTACTGCTAAGGTTC 

ACGGTTCTCCATTGGGTGCTGATGGATTGAAGAATTTGAAGGAAACTTGTGGTTTCACTGGTAATGATTTCTTCCATGTT 

CCAGAAATTGTCAGAAAGGACTTTGCTACTGTCATTAATAGAAATAGTGAAAAGCTCTCTCAATGGAAGCAAGTTAAATC 

TCCCTATGATACCACTCATGCTACTGAATCCCAACTCCTCCAAAGAATGATTAATCACGAATTGGAAGGTGATGTTATGG 

AAAAGTTGCCAAAATACCTCGAACAAAAGAAGATTGCTACCAGATCTACATCTCAACAAGTTTTGAATGCCATCTATCCA 

CTCATTCCTTCTCTCGTTGGTGGTTCAGCTGACTTGACTCCATCCAACTTGACTGATGTAACTGGATGTCAAGATTTCCA 

ACCAAACAATAGAGTTGGTAGATATATCAGATTTGGTGTCCGTGAACATGCCATGGTTGCTATTGCCAATGGTATTCTCT 

ATCATGGTGTTCTTAGAACCTATGTTGGTACATTCTTGAACTTTG 

^TCTGGTCTTCCAAATATTTATGTTTTCACTCATGACAGTATTGGTCTTGGTCAAGATGGTCCAACTCACOUVCCTGT 

TCAAGTTTTACCAATGTTGATAGCCATTCCAAATCACATTGTTTTCAGACCTGCTGATGGTAGAGAAACCAGTGGTGCTT 

ATOTGTGGGCTGTTCAATCAAAGAAGACTCCATCCTCAATGATTCTTTCTCGTCAAGATTTGCCACAATTGACTCGT^^ 

GATATTTCAAAGGTTGCTTTGGGTGCCTATGTTATCCAAGGTGATGCTACTCCTGATGTTGTCCTTGTTGGTACTGGTTC 

TGAAGTTTCCCTCATGGTTGAAGCTGCTGAAAAGTTGAAGGCTAACCTTAAGGTTAACGTTGTTTCCATGCCAAGTTGGG 

AATTGTTTGTTCGTCAATCAGAAGAATACAGGAAGACTGTCTTCCCAGATGGTATTCCAGTTGTCAGTGCCGAAGCTTCA 

TCAACCTTTGGTTGGACAAGCTTTGCTCACTATGCTGTTGGTATGACTACTTTCGGTGCTAGTGCTGCTGCTGAAGAAGT 

TOACAAACTCCTCAAGATTACCTCAGACAATGTTGCTGAAAAGGCCACCAAATTGGTTACCAAGTATGGTAAGCAAGCTC 
CAAGACTCAGCTTGTCTCTTGTTGGTGAAGAACTCTAA 



Figure 5. The amino acid sequence encoded by the entire Naegleria TTK gene 



LSEAKAiUlFNKDGQNIFDHHTYVFLGTCCr^ 

PEIVIIKDFAWINRNSEKLSQWKQVKSAYDTTHATESQLLQRMINHELEGDVMEKLPKYLE 



IQDLPQLTGT 



Figure 6. The DNA sequence of the 1068 bp Naegleria thiaminase I segment, as cloned 
in pNB 1+ and expressed from the 3414 base genomic sequence of gene TTK. 



ATGTCCACTCAAO^AAGA^^ 

ATTGATCAATTTGATCAAGGATGTTTTGCCAACTCAGGTT^ 

GTGATGCTAGTCTTCAAAGTCTTCCAGATGTTTTCTCAACTGATAGCATTTTCCTTCCATATCTT 

GTCAAGAGTTTGGATGAATCATTGGTTCGTGGTGTTACTGGTGATTTGCATAGTTTTGTTTCCTCAAGTGC 

TGGTTCCGTTTATGGTTTCCCACAATACTTGTGCTCAAACTTTTTATTGTCCTCACCAAATGGTACT 

CCCTTTTAGAATTGGCTCAAAAGGTTGGTTATGAACAAATTGTTTATCCAGATGTTGCCTCTTCTAG^ 

TTC(MATTGTATCAACAATTACTCCAATCATCATCATCAGCTGCAGTTGATATCAAGGCCTCTGATCCT 

TGACCAAGTOlACAAGGATATCACTCAAAAATATAGAACCATTTTGGATTCAACAGTTGTTGCCTCT 

TTAACTCTGTAAAGCAAGGTAAACCAATTTCAAACTACTATGTCGGATATAGTGAAAGTATGTGTGAA^ 

ATCAGAGATCAACAATACAATGTTCAACT<^TTGGTACCTCrGATAAG'CCATACGTTTATACTGATGT 

TTCCAATTTGTGTGATGAAAAGCAAAAGGTTGCTGTTGAAGTTATCAAGAATTTATTGACTAAT^ 

TGTTGGGTCTCGGATTAACTCTCCCAGCCAACAAGAATGGTATTGCTCATTTGGCTAAATCATCA 

TTGAGCCAACAATTCGATGCCAAGGAAAGTGAAGTTAGAGTTTTGAGATGTGTTGACTTTGCTAACAAGGA 
TTGTGCTGGTGTCTTGAGACCATTCCTT 
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Figure 7. The encoded amino acid sequence of the expressed Naegleria thiaminase I gene, 
along with its codons, and with residues numbered (nucleotide/amino acid). 

"1 31/11 

ATG TCC ACT CAA CCA AAG ACA CTC ACT ■ GTT GGT CTC TTC CCA TAT CTT CCT TCT TGG AAT 
Met ser thr gin pro lys thr leu thr val gly leu phe pro tyr leu pro ser trp asn 

61/21- 91/31 

GAA AAT GGC AAC GAA GTT AAA TTG ATC AAT TTG ATC AAG GAT GTT TTG CCA ACT CAG GTT 
glu asn gly asn glu val lys leu ile asn leu ile lys asp val leu pro thr gin val 

121/41 151/51 

TCC GGA TAT AAT ATC GAA TAT ACC GAA TTT GAT TGT TAC AGT GAT GCT AGT CTT CAA AGT 
ser gly tyr asn ile glu tyr thr glu phe asp cys tyr ser asp ala ser leu gin ser 

181/61 211/71 

CTT CCA GAT GTT TTC TCA ACT GAT AGC ATT TTC CTT CCA TAT CTT GTT TCT TTG GGT GGT 
leu pro asp val phe ser thr asp ser ile phe leu pro tyr leu val ser leu gly. gly 

241/81 271/91 

GTC AAG AGT TTG GAT GAA TCA TTG GTT CGT GGT GTT ACT GGT GAT TTG CAT AGT TTT GTT 
val lys ser leu asp glu ser leu val arg gly val thr gly asp leu his ser phe val 

301/101 331/111 

TCC TCA AGT GCC TCT GTC AAT GGT TCC. GTT TAT GGT TTC CCA CAA TAC TTG TGC TCA AAC 
ser ser ser ala ser val asn gly ser val tyr gly phe pro gin tyr leu cys ser asn 

361/121 391/131 

TTT TTA TTG TCC TCA CCA AAT GGT ACT CAA CAA GCA TCT TCC CTT TTA GAA TTG GCT CAA 
phe leu leu ser ser pro asn gly thr gin gin ala ser ser leu leu glu leu ala gin 

421/141 451/151 

AAG GTT GGT TAT GAA CAA ATT GTT TAT CCA GAT GTT GCC TCT TCT AGT TCT TTC ACA GTT 
lys val gly tyr glu gin ile val tyr pro asp val ala ser ser ser ser phe thr val 

481/161 511/171 

TTC GGA TTG TAT CAA CAA TTA CTC CAA TCA TCA TCA TCA GCT GCA GTT GAT ATC AAG GCC 
Phe gly leu tyr gin gin leu leu gin ser ser ser ser ala ala val asp ile lys ala 

541/181 571/191 

TCT GAT CTT CCA CAA TCT GGT GAC CAA GTC AAC AAG GAT ATC ACT CAA AAA TAT AGA ACC 
ser asp leu pro gin ser gly asp gin val asn lys asp ile thr gin lys tyr arg thr 

601/201 631/2U 

ATT TTG GAT TCA ACA GTT GTT GCC TCT CAA AGA GAA TAT ATT AAC TCT GTA AAG CAA GGT 
lie leu asp ser thr val val ala ser gin arg glu tyr ile asn ser val lys gin gly 

661/221 691/231 

AAA CCA ATT TCA AAC TAC TAT GTC GGA TAT AGT GAA AGT ATG TGT GAA ATT AAG GAT ATC 
lys pro ile ser asn tyr tyr val gly tyr ser glu ser met cys glu ile lys asp ile 

721/241 751/251 

ATC AGA GAT CAA CAA TAC AAT GTT CAA CTC ATT GGT ACC TCT GAT AAG CCA TAC GTT TAT 
lie arg asp gin gin tyr asn val gin leu ile gly thr ser asp lys pro tyr val tyr 

ACT GAT GTT TTG GCT TTG AAT TCC AAT TTG TGT GAT GAA AAG CAA AAG GTT GCT GTT GAA 
thr asp val leu ala leu asn ser asn leu cys asp glu lys gin lys val ala val glu 

841/281 871/291 

GTT ATC AAG AAT TTA TTG ACT AAT ACT TTA GTT TTG GAC TTG TTG GGT CTC GGA TTA ACT 
val ile lys asn leu leu thr asn thr leu val leu asp leu leu gly leu gly leu thr 1 

901/301 931/3U 

CTC CCA GCC AAC AAG AAT GGT ATT GCT CAT TTG GCT AAA TCA TCA AAC TTT TAT GCT CAA 
leu pro ala asn lys asn gly ile ala his leu ala lys ser ser asn phe tyr ala gin 

961/321 991/331 

TTG AGC CAA CAA TTC GAT GCC AAG GAA AGT GAA GTT AGA GTT TTG AGA TGT GTT GAC TTT 
leu ser gin gin phe asp ala lys glu ser glu val arg val leu arg cys val asp phe 

1021/341 1051/351 

GCT AAC AAG GAA GTT AAG AAT TGT GCT GGT GTC TTG AGA CCA TTC CTT 

ala asn lys glu val lys asn cys ala gly val leu arg pro phe leu 



Figure 8. Comparison of the amino acid sequence of Naegleria thiaminase 1 (N40KAT) aligned 
with the sequences of Bacillus thiaminase (BSTP) and segments of the transketolases of six 
organisms. The numbering system indicated here is for the yeast transketolase. 



Abbr ev . 


Genbank 


Enzyme and .Organism 


ScTKTlct - 


P23254 


Transketolase of Saccharomyces cerevisiae 


CpTKT7ct - 


Q42677 


Transketolase of Craterostigma plantagineum 


EcTKT2ct - 


P33570 


Transketolase of Escherichia coli 


BsTKTct - 


P45694 


Transketolase of Bacillus subtilis 


MgTKT 


P47312' 


Transketolase of Mycoplasma genitalium 


MjPTKl 


Q58092 


Transketolase of Methanococcus jannaschii 


BSTP 


P45741 


Thiaminase I precursor from Bacillus thi amino lyticus 


N40KAT - 


> 


Thiaminase I of Naegleria gruberi, aa 1-356 



ScTKTlct ADDVK QLKSKFGFNPDKSFVVPQEVYD-HYQKTILKPGVEANNKWNKLFSEYQKKFP 56 

CpTKT7ct PKEAE ATRKNLGW-PYEPFHVPDDVKK-HWSRHIAE-GAALESAWNAKFAEFQKKFP 

ECTKT2 C t EEEVA- - - LARQKLGW-HHPPFEI PKEIY- -HAWDAREK-GEKAQQS WNEKFAAYKKAHP 

BsTKTct KEESK LTKEAYAWTYEEDFYVPSEVYE-HFAVAVKESGEKKEQEWNAQFAKYKEVYP 

MgTKT EVDFQ LFEKRTNT-NFNFFNYPDSIYH-WFKQTVIERQKQIKEDYNNLLISLKDK-P 

MjPTKl 

BSTP MS KVKGF I YKPLMVMLALLLWVS PAGAGAAHSDAS SDI TLKVAI YPYVPD PARFQAAVL 

N40KAT MSTQP KTLTVGLFPYL PS WNENGNEVKLI NL I KDVLPT 

ScTKTlct ELGAELARRLSGQLPANWESKLPTYTAKDSA VATRKLSETVLEDVYNQLPELIGGS 112 

CpTKT7ct EEAADLKS I ITGELPTNWES IFPTYTPENPG LPTRTLSHQILNGLGDVLPGLLGGS 

EcTKT2c t QLAEEFTRRMSGGLPKDWEKTTQKYINELQANPAKIATRKASQOT 

BsTKTct ELAEQLELAIKGELPKDWDQEVPVYE-KGSS LASRAS SGEVLNGLAKKI PFFVGGS 

MgTKT -LFKKFTNWIDSDFQALYLNQLDEKKVAKKD SATRNYLKDFLNQINNPNSNLYCLN 

M j PTK1 MVKLSGVYKG MRKGYGETLI ELGKKYENLWLD 

BSTP DQ WQRQEPGVKLEF -TDWDSYS ADPPDDLDV ; -FVLDSI FLSHFVDAGYLLP -FGSQD 

N40KAT QVSGYNIEY-TEFDCYSDASLQSLPD VFSTDSIFLPYLVSLGGVKSLDESLV 



ScTKTlct 

CpTKT7ct 

EcTKT2ct 

BsTKTct 

MgTKT 

Mj PTK1 

BsTP 

N40KAT 



ScTKTlct 

CpTKT7ct 

EcTKT2ct 

BsTKTct 

MgTKT 

Mj PTK1 

BsTP 

N40KAT 



ADLTPSNLTRWKEALDFQPPSSGSGNYSGRYIRYGIREHAMGA IMNGI S AFGANYKPYGG 172 

ADLTLSKMAFLKNSGDFQKKSPGE RNVKFG AREHAMG S ICNGLALHSPGLLPYCA 

ADLAPSNLTIWKGSVSLKEDPAGN YI HYGVREFGMT A IANGIAHHG-GFVPYTA 

AD LAG S NKTT I KNAGD FTAVD YS G KNFWFGVREFAMGA ALNGMALHG -GLRVFGG 

ADVSRS - -CFIKIGDDNLHENPCS RNIQIGIREFAMAT IMNGMALHG-GIKVMGG 

ADLSGS TQTAMF AKEFPE RF FNAG VAEQNMI G MAAGLATTG — KIVFAS 

IDQAEDVLPFALQGAKRNGEVYGLP- ----- -QI LCTNLLFYRKGDLKIGQVDNI YELYKKIG ' 

RGVTGDLHS FVS S S ASVNGS VYGF P QYLCSNFLLSS PNGTQQ AS - S LLELAQ 

. : . . . .| 

Catalytic Cys 

TFLNFVS-YAAGAVR-LSALSGHPVIWVATHDSIGV-GE DG-PTHQPIET — LAHF 222 

TYFVFTD-YMRAAMR-ISALSKARVLYIMTHDSIGL-GE DG-PTHQPVEH — LASF 

TFLMFVE-YARNAAR-MAALMKARQIMVYTHDSIGL-GE DG-PTHQAVEQ — LASL 

TFFVFSD-YLRPAIR-LAALMGLPVTYVFTHDSIAV-GE DG-PTHEPVEQ — LASL 

TFLAFAD-YSKPAIR-LGALMNLPVFYVYTHDSYQV-GG DG-PTHQPYDQ — rLPML 

SFSMFASGRAWEIIRNLVAYPKLNVKIVATHAGITV-GE DG-ASHQMCED — IAIM 

TSHSEQIPPPQNKGLLINMAGGTTKASMYLEALIDVTGQYTEYDLLPPLDPLNDKVIRGL 
KVGYEQIVYPDVASSSSFTVFGLYQQLLQSSSSAAV DIKASDLPQSGD-QVNK 



Peptide A -> ASDLPQSGD-QVNK 



* 



Figure 8, continued. 



ScTKTlct 

CpTKT7ct 

EcTKT2ct 

BsTKTct 

MgTKT 

MjPTKl 

BsTP 

N40KAT 



ScTKTlct 

CpTKT7ct 

EcTKT2ct 

BsTKTct 

MgTKT 

MjPTKl 

BsTP 

N40KAT 



RS L PNI QVWRP ADGN- EVS AAYKNS LESKHTPS 1 I ALS RQNL PQLEGS SIESASKGG 278 

RAMPNI LT LRP ADGN - ET AG A YRAAVQNGERP S I L VLARQKL PQ L PGT S IEGVSKGG 

RLTPNFSTWRPCDQV-EAAVGWKIAVERHNGPTALIL 

RAMPNLSLIRPADGN-ETAAAWKLAVQSTDHPTALVLTRQNLPTIDQTSEEALAGVEKGA 
RAIEWCWRPCDEK-ETCAGFNYGLLSQDQTTVLVLTRQPLKSIDNTD — SLKTL-KGG 
RAJPNMVVIAPTDYY-HTKNVIRTIAEYKG-PVYVRMPRRDTEIIYENEEEATFEIGKGK • 
RLLINMAGEKPSQYVPEDGDAYVRASWFAQ-GSGRAFIGYSESMMRMG — -DYAEQVRFK 
DITQKYRTILDSTW-ASQREYINSVKQGK-PISNYYVGYSESMCEIK DIIRDQQYN 

Peptide B -> TILDSTW-ASQR 

YVLQDVAN PDIILVATGSEVSLSVEAAKTLAAKNIKARWSLPDFFTFDKQPLE- 332 

YVISDNSRGGNSKPDVILIGTGSELEIAARAGDELRKEGKKVRWSLVCWELFAEQSEK- 

YVLKDSGG KPDIILIATGSEMEITLQAAEKLAGEGRNVRWSLPSTDIFDAQDEE- 

YVVSKSKN--E-TPDALLIASGSEVGLAIEAQAELAKENIDVSWSMPSMDRFEKQSDE- 

YILLDRKQ PDLIIAASGSEVQLAIEFEKVLTKQNVKVRILSVPNITLLLKQDEK- 

ILVDG EDLTIIATGEEVPEALRAGEILKENGISAEIVEMATIKPIDEEIIKK 

PISSSAG QDI PLFYSDWSWSKTAHPELAKKLANVMASADTVEQALRPQADGQ 

VQLIGTS DKPYVYTDVLALNSN- — LCDEKQKVAVEVIKNLLTNTLVLDLLG - 



ScTKTlct 

CpTKT7ct 

EcTKT2ct 

BsTKTct 

MgTKT 

Mj PTK1 

BsTP 

N40KAT 



ScTKTlct 

CpTKT7ct 

EcTKT2ct 

BsTKTct 

MgTKT 

MjPTKl 

BsTP 

N40KAT 



YRLSVLPDNVPI -MSVEVLATTCWGKYAH QSFGIDRFGASGKAPEVFKFFGFTP 3 85 

YRETVLPSGVTARVSVEAGSTFGWERFIGP-KG--KAVGIDRFGASAPAERLFKEFGITV 
YRESVLPSNVAARVAVEAGIADYWYKYVGL-KG — AIVGMTGYGESAPADKLFPFFGFTA 
YKNEVLPADVKKRLAI EMGS SFGWGKYTGL-EG — DVLGIDRFGASAPGETI INEYGFSV 
YLKSLFDANSSL-ITIEASSSYEWFCFKKY-VKNHAHLGAFSFGESDDGDKVYQQKGFNL 
SKDFVVTVEDHSIIGGLGGAVAEVIASNGLNKKLLRIGINDVFGRSGKADELLKYYGLDG 
YPQYLLPARHQV-YEALMQDYPIYSELAQIVNK — PSNRVFRLGPEVRT-WLKDAKQVLP 
- LGLTL PANKNG - 1 AHLAKS SNFYAQLSQ QFDAKESEVRVLRCVDFANKEV 

Peptide C -> S SNFYAQLSQ --- QFDAK 

EGVAERAQKTI AFYKGDKL I S PLKKAF 412 

EAWA- AAKEI C 

ENIVAKAHKVLGVKGA 

PNWNRVKALINK 

ERLMKI FTSLRN 

ESIAKRIMEEMKKE 

EALGLTDVSSLAS 

KNCAGVLRPFL 



